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We present a search for the standard model Higgs boson and a fermiophobic Higgs boson in the 
diphoton final states based on 8.2 fb~^ of pp collisions at -/s — 1.96 TeV collected with the DO 
detector at the Fermilab Tevatron Collider. No excess of data above background predictions is 
observed and upper limits at the 95% C.L. on the cross section multiplied by the branching fraction 
are set which are the most restrictive to date. A fermiophobic Higgs boson with a mass below 112.9 
GeV is excluded at the 95% C.L. 

PACS numbers: 14.80.Bn, 13.85. Rm, 14.80.Ec, 12.60.Fr 



In the standard model (SM), the Higgs boson [H] is 
the last undiscovered particle that provides crucial in- 
sights on the spontaneous breaking of the electroweak 
symmetry and the generation of mass of the weak gauge 
bosons and fermions. The constraints from the direct 
searches at the CERN e+e~ Collider (LEP) 1] and from 
the measurement of precision electroweak observables [0] 
result in a preferred range for the SM Higgs boson mass 
of 114.4 < Mh < 185 GeV at 95% C.L. Furthermore, 
the range 158 < Mh < 173 GeV is excluded at 95% 
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C.L. by the direct searches at the Fermilab Tevatron pp 
Collider |3j. These experimental constraints are derived 
assuming SM production and decay modes for the Higgs 
boson and can be substantially modified in case of signif- 
icant departures from the SM. 

At hadron coUiders the dominant production mecha- 
nisms for a light SM Higgs boson are gluon fusion (GF) 
{gg — > H), associated production with a W or Z bo- 
son {qq' -^ VH, V — W,Z), and vector boson fusion 
(VBF) {VV -^ H). At the Tevatron the most sensitive 
SM Higgs boson searches rely on the VH{H -^ bb) pro- 
cess for Mh < 125 GeV and on gg ^ H ^ W+W~ 
for Mh > 125 GeV. At CERN's Large Hadron CoUider 
(LHC), the strategy at high Mh (> 140 GeV) is similar, 
while at low Mh (< 140 GeV) the i/ — j' 77 decay mode 
becomes one of the most promising discovery channels, 
despite its small branching ratio of B{H —> 77) sa 0.2% 
for 110 < Mh < 140 GeV, owing to its clean exper- 



imental signature of a narrow resonance on top of a 
smoothly-falling background in the diphoton mass spec- 
trum. Some of the most sensitive searches for the SM 
Higgs boson involve the loop-mediated ggH and/or ^^H 
vertices, which are sensitive to new physics effects. For 
instance, the addition of a sequential fourth family of 
quarks can substantially enhance the ggH coupling, lead- 
ing to an increase in the GF production rate, while de- 
creasing B{H -^ hh) [J]. Alternatively, other models of 
electroweak symmetry breaking can involve suppressed 
couplings to some or all fermions [5|, with the extreme 
case being the fermiophobic Higgs boson (i?/) model, 
where the GF production mode is absent, decays into 
fermions are heavily suppressed, and B{Hf — >■ 77) is 
significantly enhanced. Therefore, Higgs boson searches 
in the 77 decay mode can be a sensitive probe of new 
physics models where the Higgs boson may be difhcult 
to observe in other, a priori more promising, channels. 

This Letter presents a search for a Higgs boson decay- 
ing into 77 using an inclusive diphoton sample collected 
with the DO detector in pp collisions at y/s ~ 1.96 TeV at 
the Fermilab Tevatron Collider. In this search both the 
SM and the fermiophobic Higgs boson models are consid- 
ered. The most recent searches at the Tevatron for a SM 
Higgs boson [6| or a fermiophobic Higgs boson [7| in the 
77 mode analyzed the diphoton invariant mass spectrum 
in search for a narrow resonance. This analysis represents 
a significant step forward in sensitivity by increasing the 
dataset by nearly a factor of three, as well as by ex- 
ploiting further kinematic differences between signal and 
background through a multivariate analysis technique. 

The DO detector is described in detail elsewhere Q. 
The subdetectors most relevant to this analysis are the 
central tracking system, composed of a silicon microstrip 
tracker (SMT) and a central fiber tracker (OFT) in a 2 T 
solenoidal magnetic field, the central preshower (CPS), 
and the liquid-argon and uranium sampling calorime- 
ter. The CPS is located immediately before the inner 
layer of the calorimeter and is formed by one radiation 
length of absorber followed by several layers of scintil- 
lating strips. The calorimeter consists of three sections 
housed in separate cryostats: a central section covering 
up to \r]\ « 1.1 [9j and two end calorimeters extending the 
coverage up to I77I ~ 4.2. They are divided into electro- 
magnetic (EM) and hadronic layers. The EM section of 
the calorimeter is segmented into four longitudinal layers 
with transverse segmentation of A77 x A(f> — 0.1 x 0.1 9], 
except in the third layer (EMS), where it is 0.05 x 0.05. 
The calorimeter is well suited for a precise measurement 
of electron and photon energies, providing a resolution 
of ~ 3.6% at electron and photon energies of ~ 50 GeV. 
The data used in this analysis were collected using trig- 
gers requiring at least two clusters of energy in the EM 
calorimeter and correspond to an integrated luminosity 
of 8.2fb-i [13]. 

Events are selected by requiring at least two photon 



candidates with transverse momentum px > "25 GeV 
in the central region of the calorimeter (|7/| < 1.1), for 
which the trigger requirements are close to 100% effi- 
cient. Photon candidates are selected from EM clusters 
reconstructed with a simple cone algorithm with radius 
n = ^y{Ari)^ + (A0)2 =0.2 that satisfy the following 
requirements: (i) at least 95% of the cluster energy is de- 
posited in the EM calorimeter; (ii) the calorimeter isola- 
tion variable / = [£;tot(0.4) - £;em(0.2)]/£;em(0.2) is less 
than 0.1, where i?tot(0.4) is the total energy in a cone of 
radius TZ = OA and £^em(0.2) is the EM energy in a cone 
of radius TZ = 0.2; (ui) the energy- weighted cluster width 
in EMS is consistent with an EM shower; (iv) the scalar 
sum of the px of all tracks originating from the primary 
pp interaction vertex in an annulus of 0.05 < TZ < 0.4 
around the cluster is less than 2 GeV; (v) the EM clus- 
ter is not spatially matched to tracker activity, either to 
a reconstructed track, or to a set of hits in the SMT 
and CFT consistent with that of an electron or positron 
trajectory [ll|; and (vi) the output of a photon neural 
network (Onn) [SI, II4I, combining information from a 
set of variables that are sensitive to differences between 
photons and jets in the tracker, the calorimeter and the 
CPS, is larger than 0.1. The requirement (vi) rejects ap- 
proximately 40% of the misidentified jets, while keeping 
> 98% of real photons. Finally, additional kinematic se- 
lections are applied in order to select a signal-enriched 
sample. The diphoton invariant mass, M^j, computed 
from the two highest px photons in an event, is required 
to be larger than 60 GeV. The azimuthal angle between 
the two photons, A^-y.^, is required to be larger than 0.5, 
which reduces the background from events where both 
photons originate from fragmentation, a process that is 
not well modeled in the simulation, while keeping > 99% 
of the Higgs boson signal. 

The selected data sample is contaminated by back- 
grounds of instrumental origin such as 7-fjet (7J), di- 
jet (jj) and Z/j* -> e+e~ (ZDY) production, with 
jets or electrons misidentified as photons, as well as a 
background from direct 77 production (DDP) where two 
isolated photons are produced. The normalization and 
shape of the jj and jj backgrounds, as well as the over- 
all normalization of the DDP background, are estimated 
from data. The Monte Carlo (MC) simulation is used to 
model the normalization and shape of the signal and ZDY 
background, as well as the shape of the DDP background. 
The MC samples used in this analysis are generated us- 
ing PYTHIA [1^ (for signal and ZDY) or SHERPA [ij] (for 
DDP) with CTEQ6L1 ^ parton distribution functions 
(PDFs), followed by a GEANT-based [l^] simulation of the 
DO detector. Events from randomly selected beam cross- 
ings are overlaid on the simulated events to better model 
contributions from additional pp interactions and detec- 
tor noise. The same reconstruction algorithms are used 
as on the data. Signal samples are generated separately 
for the GF, VH and VBF processes and normalized using 



TABLE I: Signal, backgrounds and data yields for Mr = 100 GeV to 150 GeV in 10 GeV intervals within the [Mr - 30 GeV, 
Mh + 30 GeV] mass window. The background yields result from a fit to the data. The uncertainties include both statistical 
and systematic contributions and take into account correlations among processes. The uncertainty on the total background 
is smaller than the sum in quadrature of the uncertainties in the individual background sources due to the anti-correlation 
resulting from the fit. 



Mh (GeV) 


100 


110 


120 


130 


140 


150 


77 (DDP) 

7i + 3J 
Zh* -^ e+e- 


6415 ± 395 

5727 ± 352 

599 ± 91 


4031 ± 286 

3819 ± 252 

517 ± 81 


2779 ± 188 

2265 ± 178 

361 ± 55 


1849 ± 139 

1506 ± 120 

141 ± 23 


1355 ± 99 
964 ± 87 
65 ± 12 


1026 ± 75 
641 ± 63 

34 ± 7 


Total background 
Data 


12741 ± 160 
12746 


8367 ± 134 
8380 


5405 ± 95 
5406 


3496 ± 77 
3500 


2384 ± 57 
2383 


1701 ± 48 
1696 


H boson signal 
Hf boson signal 


5.9 ± 0.8 
149.7 ± 13.2 


5.8 ± 0.8 
39.4 ± 3.5 


5.3 ± 0.7 
11.9 ± 1.0 


4.2 ± 0.6 
4.4 ± 0.4 


2.9 ± 0.4 
1.8 ± 0.2 


1.7 ± 0.2 
0.7 ± 0.1 



the next-to-next-to-leading order (NNLO) plus next-to- 
next-to-leading-logarithm (NNLL) theoretical cross sec- 
tions for GF and NNLO for VH and VBF processes ^- 
ligj . The Higgs boson's branching ratio predictions are 
from HDECAY ji^]. The ZDY background estimate from 
MC is normalized to the NNLO cross section 1211 . 

The 7J and jj yields are estimated with data [22| . Fol- 
lowing the final selection, a tightened Onn requirement 
{Onn > 0.75) is used to classify the events into four cat- 
egories: (i) both photons, (ii) only the highest pr (lead- 
ing) photon, (iii) only the second highest pt (trailing) 
photon, or (iv) neither of the two photons, satisfy this 
requirement. The corresponding numbers of events, af- 
ter subtracting the ZDY contribution, are denoted as (i) 
Npp, (ii) Npf, (iii) Nfp and (iv) Nff. The different efB- 
ciency of the Onn > 0.75 requirement for photons (e-y) 
and jets (cjot) is used to estimate the sample composition 
by solving a linear system of equations: 

{Npp, Npf, Nfp, Nfff = £ X {N^^, N^j,Nj^, N,,f, 

where Nj^ (Njj) is the number of 77 (jj) events and Njj 
(iVj-y) is the number of jj events with the leading (trail- 
ing) cluster as the photon. The 4x4 matrix £ contains 
the efficiency terms e-y and Cjet, parameterized as a func- 
tion of 1 77 1 for each photon candidate and estimated in 
photon and jet MC samples. We validate e.y with data of 
radiated photon from charged leptons in Z boson decays 
(Z — >• l'^l^j,l — e,^) and ejot with jet data [23[. The 
DDP normalization is determined from a fit to the final 
discriminant distribution used for hypothesis testing, ex- 
ploiting the difference in shape between signal and back- 
ground in each Mh search region. For each Mh hypoth- 
esis (between 100 and 150 GeV in steps of 2.5 GeV), the 
search region is defined to be Mh ± 30 GeV. The shape 
of the DDP background is obtained from SHERPA [14| . 
while the shapes of the 7J and jj backgrounds are ob- 
tained from independent data control samples selected 
by requiring exactly one photon or both photon candi- 
dates to satisfy Onn < O.l, respectively. Table H] shows 
the numbers of data events, expected background, and 
the expected H boson and Hf boson signals in six of the 



search regions resulting from a fit described later in this 
Letter. The estimated background composition is « 53% 
from DDP, sa 44% from jj+jj and w 3% from ZDY. 

To improve the sensitivity of the search, a total of five 
well-modeled kinematic variables are used to discrimi- 
nate between signal and background: M^^, Acjj-y^, the 
transverse momentum of the diphoton system (p^), and 
the transverse momenta of the leading and trailing pho- 
tons {p}p, p^). Figure [Tfa) shows a comparison of the 
M^-y distribution between data and the background pre- 
diction. Comparisons for other kinematic distributions 
can be found in Ref. [l3| . A boosted-decision-tree (BDT) 
technique [24] is used to build a single discriminant vari- 
able combining the information from the above five vari- 
ables. A different BDT is trained for each Mh hypoth- 
esis, separately for the SM and the fermiophobic Higgs 
boson models. In each model, the training is performed 
to discriminate between the sum of all relevant signals 
and the sum of all backgrounds. Figure [TJb) shows a 
comparison of the BDT output distribution between data 
and background prediction corresponding to the SM for 
Mh = 115 GeV. 

Systematic uncertainties affecting the normalization 
and shape of the BDT output distribution are estimated 
for both signal and backgrounds, taking into account cor- 
relations. The sources of systematic uncertainties affect- 
ing the signal and ZDY background normalizations in- 
clude the integrated luminosity (6.1%), photon identifica- 
tion efficiency for signal (3.9%) or electron misidentifica- 
tion rate for ZDY (12.7%) and theoretical cross sections 
(including scale and PDF uncertainties) for signal (GF 
(14.1%), VH (6.2%) and VBF (4.9%)) and ZDY (3.9%) 
production. The normalization uncertainty affecting the 
"fj+jj prediction is 8.4%. This uncertainty results from 
propagating the uncertainty on the Onn > 0.75 effi- 
ciency for photons (1.5%) and jets (10%) and also affects 
the shape of the 'yj+jj background at the l%-2% level 
through changes in the fractions of 7J and jj. Addi- 
tional systematic uncertainties affecting the differential 
distributions of data and MC include the relative pho- 
ton energy scale (l%-5% for signal, l%-4% for DDP), 



-O 4500 

« 4000 

O 
^3500 



« 



3000 

-2500 

C 2000 

« 

>1500 

1000 



n 


L D0, 


8.2 fb-' 


n 


L(a) 


+ Data 






DTY 


u 


-. 


y+jel 





Oil 


■jet+jet 




: U 


Z/7*~> ee 








: 


-signalxlOO (M^=115 GeV) 


^ 


80 


100 120 140 160 180 20 



lli„ (GeV) 



c 
!5 

CM 

d 

« Iff" 

c 

§ 

UJ 



10° 



10= 



10' 



10= 



10 



D0, 8.2 fb-' 
(b) +Data 

7+jet 
■jet+jet 

--^ Zly'-f ee 

-signalxSO (M^=115 GeV) 




1 -0.8-0.6-0.4-0.2 



0.2 0.4 0.6 0.8 1 

BDT output 




—Observed 
■ Expected 
|Expected±1 s.d. 
nExpected±2s.d. 



100 



130 



140 150 
Mh (GeV) 



FIG. 1: (color online), (a) Al^-y and (b) BDT output distributions for Mh = 115 GeV after the final selection comparing 
data to the background prediction. The expected H boson signal multiplied by a factor of 50 is also shown, (c) Observed and 
expected 95% C.L. upper limits on a x B relative to the SM prediction as a function of AIh- The bands correspond to the ± 1 
and ± 2 standard deviations (s.d.) around the expected limit under the background-only hypothesis. 
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FIG. 2: (color online). Observed and expected 95% C.L. 
upper limits on B{Hf — >■ 77) as a function of Mut- The defi- 
nition of the bands are the same as in Fig. [TJc). The blue line 
represents the branching ratio predictions from hdecay |20| |. 
Also displayed is the exclusion region obtained by the LEP 
Collaborations Wh. 



DDP modeling (1%-10%) and Higgs boson pT modeling 
in GF (l%-5%). Modeling uncertainties are obtained by 
changing the factorization and renormalization scales by 
a factor of two with respect to the nominal choice. 

No evidence for a signal, either in the SM or in the 
fermiophobic interpretations, is found, and the BDT dis- 
criminants are used to derive upper limits on the produc- 
tion cross section multiplied by the branching ratio for 
iJ —>■ 77 (cr X S) as a function of Mh- Limits are cal- 
culated at the 95% C.L. with the CLs modified frequen- 
tist approach using a log-likelihood ratio of the signal- 
plus-background (S+B) hypothesis to the background- 
only (B) hypothesis j25.] . Systematic uncertainties are 



taken into account by convoluting the Poisson probabil- 
ity distributions for signal and background with the cor- 
responding Gaussian distributions. The individual like- 
lihoods are maximized with respect to the DDP back- 
ground normalization as well as parameters that describe 
the systematic uncertainties [26j. This fit allows the 
determination of the normalization for the DDP back- 
ground from data and significantly reduces the impact of 
systematic uncertainties on the overall sensitivity. 

The resulting upper limits on a y.B relative to the SM 
prediction as a function of Mh are shown in Fig. [IJc), 
representing the most constraining results for a SM Higgs 
boson decaying into photons. Upper limits on B{Hf — >• 
77) as a function of Mh^ are presented in Fig.[2]and com- 
pared to the LEP result |23 . The sensitivity is improved 
by about a factor of two relative to previous searches at 
the Tevatron [7|, yielding the most stringent limits on 
a fermiophobic Higgs boson of Mhj > 112.9 GeV at 
95% C.L. 



Supplementary material is provided in 12|. 
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FIG. 3: Normalized Onn spectrum for photons from radiative Z boson decays, photons from diphoton MC and jets from dijet 
MC. The Ojvjv distribution has strong discriminanting power between photons and jets, and the performances in photon data 
and MC are consistent. 



TABLE II: Cross sections for the different Higgs boson production mechanisms and branching fractions for the SM Higgs 
boson decays into two photons {B{H — >■ 77)) and for fermiophobic Higgs boson decays into two photons {B{Hf — >■ 77)) as a 
function of Mr- 



Mh (GeV) 



>g (fb) 



awH (fb) 



CZH (fb) 



avBF (fb) 



B(H -^ 77) (%) 



BjHf ^ 77) (%) 



100 


1821.8 


291.9 


168.9 


100.1 


0.159 


105 


1584.7 


248.4 


145.9 


92.3 


0.178 


110 


1385.0 


212.0 


125.7 


85.1 


0.197 


115 


1215.9 


174.5 


103.9 


78.6 


0.213 


120 


1072.3 


150.1 


90.2 


72.7 


0.225 


125 


949.3 


129.5 


78.5 


67.1 


0.230 


130 


842.9 


112.0 


68.5 


62.1 


0.226 


135 


750.8 


97.2 


60.0 


57.5 


0.214 


140 


670.6 


84.6 


52.7 


53.2 


0.194 


145 


600.6 


73.7 


46.3 


49.4 


0.168 


150 


539.1 


64.4 


40.8 


45.8 


0.137 



18.46 
10.42 
6.027 
3.658 
2.334 
1.556 
1.073 
0.7586 
0.5441 
0.3902 
0.2733 



TABLE III: Expected and observed upper limits at 95% C.L. on a x B {H 
for a SM Higgs boson as a function of Mr- 



77) and on the ratio relative to the SM prediction 



Mh (GeV) 
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X B (fb) 
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FIG. 4: Variables used as input to the BDT: (a, b) Af^^, (c, d) p^ and (e, f) Ac/i^,^, in linear scale (left column) and logarithmic 
scale (right column). Data are compared to the background prediction. Also shown is the expected signal for a SM Higgs boson 
{Mh = 115 GeV) and a fermiophobic Higgs boson {Mhi ~ 115 GeV) both multiplied by a factor of 50. For the fermiophobic 
Higgs boson, the GF production is absent and the diphoton system is on average more boosted. 
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FIG. 5: Variables used as input to the BDT: (a, b) leading photon pr and (c, d) trailing photon pr, in linear scale (left column) 
and logarithmic scale (right column). Data are compared to the background prediction. Also shown is the expected signal for 
a SM Higgs boson {Mh = 115 GeV) and a fermiophobic Higgs boson {Mh, ~ 115 GeV) both multiplied by a factor of 50. For 
the fermiophobic Higgs boson, the GF production is absent and the diphoton system is on average more boosted. 



TABLE IV: Expected and observed upper limits at 95% C.L. on the cross section times branching ratio for Hf — >■ 77 (cr x B) 
and on the branching ratio (B) for a fermiophobic Higgs boson as a function of Higgs boson mass. 
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FIG. 6; SM Higgs search: (a) Distributions of the BDT output after the final selection comparing data to the background 
prediction. The expected signal multiplied by a factor of 50 is also shown for Mh ~ 115 GeV. (b) Observed log-likelihood 
ratio (LLR) as a function of Mh compared to the expected LLR for the background-only hypothesis and signal-l-background 
hypothesis. The bands correspond to the ± 1 and ± 2 standard deviations (s.d.) around the expected LLR for the background- 
only hypothesis, (c) BDT dependence of the difi^erence between data and expected background for Mh ~ 115 GeV. The 
expected signal is normalized to the observed limit on a x B. The bands represent the post-fit systematic uncertainties. 
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FIG. 7: Fermiophobic Higgs search: (a) Distributions of the BDT output after final selection comparing data to the background 
prediction. The expected signal multiplied by a factor of 50 is shown for Mhi ~ 115 GeV. (b) Observed log-likelihood ratio 
(LLR) as a function of Mhi compared to the expected LLRs for the background-only hypothesis and signal-l-background 
hypothesis. The bands correspond to the ± 1 and ± 2 standard deviations (s.d.) around the expected LLR for the background- 
only hypothesis, (c) BDT dependence of the difference between data and expected background for Mhi ~ 115 GeV. The 
expected signal is normalized to the observed limit on a x B. The bands represent the post-fit systematic uncertainties. 



